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Abstract. Given an initial placement of a set of rectangles in the plane, 
we consider the problem of finding a disjoint placement of the rectangles 
that minimizes the area of the bounding box and preserves the orthogonal 
order i.e. maintains the sorted ordering of the rectangle centers along 
both x-axis and y -axis with respect to the initial placement. This problem 
is known as Layout Adjustment for Disjoint Rectangles (LADR). It was 
known that LADR is NP-hard, but only heuristics were known for it. We 
show that a certain decision version of LADR is APX-hard, and give a 
constant factor approximation for LADR. 


1 Introduction 

Graphs are often used to visualize relationships between entities in diverse fields 
such as software engineering (e.g. UML diagrams), VLSI (circuit schematics) and 
biology (e.g. biochemical pathways) [13]. For many such applications, treating 
graph nodes as points is insufficient, since each node may have a corresponding 
label explaining its significance. The presence of labels may lead to node over¬ 
lapping. For the typical user, an uncluttered layout is more important than the 
amount of information presented [21]. For complex graphs, it is tedious to create 
meaningful layouts by hand, which has led to algorithms for layout generation. 

Layout generation algorithms typically take a combinatorial description of 
a graph, and return a corresponding layout. Nodes are usually represented by 
boxes, and edges by lines connecting the boxes. For simplicity, the edges of the 
graph are ignored while creating the modified layout. In some interactive sys¬ 
tems, modifications to the graph may happen in multiple stages. The layout must 
be adjusted after each alteration (if new nodes added overlap existing nodes), 
such that the display area is minimized. If we use layout creation algorithms after 
each iteration, we may get a layout that is completely different from the previous 
layout, which may destroy the ‘mental map’ of the user who is interacting with 
the system. Thus, we need an additional constraint in the form of maintaining 
some property of the layout, which would be equivalent to preserving the mental 
map. Eades et al. [6] defined orthogonal ordering as one of the key properties 
that should be maintained in an adjusted layout to preserve the user’s mental 
map. Two layouts of a graph have the same orthogonal ordering if the horizontal 
and vertical ordering of the nodes are identical in both layouts. 

We now state the problem studied in this paper, which involves laying out 
rectangles that represent the nodes in the graph being adjusted. We are given 


a set of rectangles R (each n £ R is defined by an ordered pair, = ( Wi,hi ), 
denoting its width and height respectively) and an initial layout A ln . A layout 
consists of an assignment A : R — > M 2 of coordinates to the centers of rectan¬ 
gles in R. The goal is to find a layout in which no two rectangles intersect and 
orthogonal ordering of the rectangle centers w.r.t A ln is maintained, while mini¬ 
mizing the area of the bounding box of the layout. We refer to this problem as 
Layout Adjustment for Disjoint Rectangles (LADR). Note that R is really a set 
of rectangle dimensions, and not a set of rectangles. Nevertheless, we will refer 
to R as a set of rectangles. See Section 2 for a more leisurely problem statement. 

1.1 Previous Work 

The concept of a mental map was introduced in [6] , along with three quantitative 
models representing it - orthogonal ordering, proximity relations and topology. 
A framework for analyzing the various models of a mental map was presented 
in [4] , which determined that orthogonal ordering constraint was the best metric 
for comparing different drawings of the same graph. A user study designed to 
evaluate human perceptions of similarity amongst two sets of drawings was given 
in [5], in which orthogonal ordering constraints received the highest rankings. 

There has been a lot of work done using the concept of preserving mental 
maps. LADR was first introduced in [18], in which the authors described the 
Force-Scan (FS) algorithm. FS scans for overlapping nodes in both horizontal 
and vertical directions, and separates two intersecting nodes by “forcing” them 
apart along the line connecting the centers of the two nodes, while ensuring that 
the nodes being forced apart do not intersect any additional nodes in the layout. 
In [12], a modification of FS was presented (FS’), which resulted in a more 
compact layout than FS. Another version of FS algorithm, called the Force- 
Transfer (FT) algorithm, was given in [14]. For any two overlapping nodes, 
denote the vertical distance to be moved to remove the overlap as d v , and let 
the horizontal distance for removing overlap be dh■ FT moves the overlapping 
node horizontally if dh < d v , else vertically, and experimentally, it has been 
shown that FT gives a layout of smaller area than FS and FS’. 

FS, FS’ and FT belong to the family of force based layout algorithms. Spring 
based algorithms treat edges as springs obeying Hooke’s Law, and the nodes 
are pushed apart or pulled in iteratively to balance the forces till an equilib¬ 
rium is reached. A spring based algorithm ODNLS , which adjusts the attrac¬ 
tive/repulsive force between two nodes dynamically, is proposed in [16], which 
preserves the orthogonal ordering of the input layout and typically returns a 
smaller overlap-free layout than the force-based family of algorithms. 

It is worth noting that none of the algorithms mentioned above give a prov¬ 
able worst-case guarantee on the quality of the output. 

The hardness of preserving orthogonal constraints w.r.t various optimization 
metrics has also been well-studied. Brandes and Pampel [3] showed that it is 
NP-hard to determine if there exists an orthogonal-order preserving rectilinear 
drawing of a simple path, and extend the result for determination of uniform 
edge-length drawings of simple paths with same constraints. LADR was shown 
to be NP-hard by Hayashi et al. [12], using a reduction from 3SAT. 
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1.2 Related Work 


Algorithms for label placement and packing that do not account for orthogonal 
ordering have been extensively studied. The placement of labels corresponding 
to points on a map is a natural problem that arises in geographic information 
systems (GIS) [22]. In particular, placing labels on maps such that the label 
boundary coincides with the point feature has been a well-studied problem. A 
common objective in such label-placement problems is to maximize the number 
of features labelled, such that the labels are pairwise disjoint. We refer to [2, 15] 
as examples of this line of work. 

Packing rectangles without orthogonality constraints has also been well- 
studied. One such problem is the strip packing problem, in which we want to 
pack a set of rectangles into a strip of given width while minimizing the height of 
the packing. It is known that the strip-packing problem is strongly NP-hard [17]. 
It can be easily seen that if the constraint for orthogonal order preservation is 
removed, then LADR can be reduced to multiple instances of strip packing prob¬ 
lem. There has been extensive work done on strip packing [20, 19, 10], with the 
current best algorithm being a 5/3 + £-approximation by Harren et al. [9]. 

Another related packing problem is the two-dimensional geometric knapsack 
problem, defined as follows. The input consists of a set of weighted rectangles and 
a rectangular knapsack, and the goal is to find a subset of rectangles of maximum 
weight that can be placed in the knapsack such that no two rectangles have an 
overlap. The 2D-knapsack problem is known to be strongly NP-hard even when 
the input consists of a set of unweighted squares [17]. Recently, Adamaszek 
and Wiese [1] gave a quasi-polynomial time (1 + e) approximation scheme for 
this problem, with the assumption that the input consists of quasi-polynomially 
bounded integers. 

1.3 Our results 

We point out an intimate connection between LADR and the problem of hitting 
segments using a minimum number of horizontal and vertical lines. In particular, 
the segments to be hit are the ones connecting each pair of rectangle centers in 
the input layout. The connection to the hitting set is described in Section 3. 
To our knowledge, this connection to hitting sets has not been observed in the 
literature. We exploit the connection to hitting set to prove hardness results 
for LADR in Section 4 that complement the NP-completeness result in [12]. We 
show that it is APX-hard to find a layout that minimizes the perimeter of the 
bounding box. We also show that if there is an approximate decision procedure 
that determines whether there is a layout that fits within a bounding box of 
specified dimensions, then P = NP. These hardness results hold even when the 
input rectangles are unit squares. The results for LADR follow from a hardness 
of approximation result that we show for a hitting set problem. The starting 
point of the latter is the result of Hassin and Megiddo [11] who show that 
it is NP-hard to determine if there is a set of k axis-parallel lines that hit a 
set of horizontal segments of unit length. The added difficulty that we need to 
overcome is that in our case, the set of segments that need to be hit cannot be 
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arbitrarily constructed. Rather, the set consists of all segments induced by a set 
of arbitrarily constructed points. 

It is possible to exploit this connection to hitting sets and use known algo¬ 
rithms for hitting sets (e.g. [8]) to devise an 0(1) approximation algorithm for 
LADR. Instead, we describe (in Section 5) a direct polynomial time algorithm 
for LADR that achieves a 4(l + o(l)) approximation. This is the first polynomial 
time algorithm for LADR with a provable approximation guarantee. The algo¬ 
rithm involves solving a linear-programming relaxation of LADR followed by a 
simple rounding. 

2 Preliminaries 

We define a layout A of a set of rectangles R as an assignment of coordinates to 
the center of each rectangle r £ R i.e. A : R —► R 2 . Our input for LADR consists 
of a set of rectangles R , and an initial layout A ln . We will assume that A ln is 
injective, i.e. no two rectangle centers coincide in the input layout. A rectangle 
r is defined by its horizontal width w r and vertical height h r , both of which are 
assumed to be integral. It is given that all rectangles are axis-parallel in A m , and 
rotation of rectangles is not allowed in any adjusted layout. 

The coordinates of center of r in layout A is denoted by A(r) = ( x r ,y r ). 
For brevity, we denote the x-coordinate of A(r) by A x (r), and the corresponding 
y-coordinate by A y (r). The set of points {A(r) : r £ R} is denoted by A (R). 

A pair of rectangles r,r' £ R is said to intersect in a layout A if and only if 

|Ax(r) - A a (r')| < Wr+ ^ Wr ‘ and |A y (r) - A y (r')| < hr+ ^ hr ' . (l) 

A layout A is termed as a disjoint layout if no two rectangles in R intersect 
with each other. Let IF) (A) and II>(A) denote the ^-coordinates of the left and 
right sides of the smallest axis-parallel rectangle bounding the rectangles of R 
placed by A, respectively. We then define the width of the layout, W(A) = 
W r (X) — Wi (A). Similarly, let H t { A) and -ffb(A) define the ^-coordinates of the 
top and bottom of the bounding rectangle, and the height of the layout is defined 
as H{ A) = H t (X) — -ffb(A). The area of A is thus defined as A(A) = H( A) x W(A). 
The perimeter of A is 2(H(X) + W(A)). 

Let A and X' be two layouts of R. Then, A and A' are defined to have the 
same orthogonal ordering if for any two rectangles r, r' £ R, 


A x (r) < A x (r') 4= 

=* A' x (r) < X' x (r’) 

(2) 

AyW < A y(r') 4= 

=► w < KV) 

( 3 ) 

A x (r) = Ax(r') <*= 

<< 

II 

'C 

( 4 ) 

x y( r ) = x y ( r ') 

4 

II 

( 5 ) 


For any R and corresponding A ln , the minimal area of a layout is defined as: 
A min = inf{A(A) : A is a disjoint layout, A has same orthogonal ordering as A ln } 
It should be noted that it may not be possible to attain a disjoint orthogonality 
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preserving layout whose area is the same as A min - we can only aim to get a 
layout whose area is arbitrarily close to A nun . 

We introduce the concept of rigidities in a layout. A set of rectangles R' C R 
forms a x-rigidity in a layout A if 3a such that R' = {r £ R \ A x {r) = a}. 
We define a y-rigidity in a layout similarly in terms of the y-coordinates of the 
rectangles in that layout. We observe that any rectangle r belongs to a unique 
x-rigidity (and a unique y-rigidity), which may consist of merely itself and no 
other rectangle. We order the x-rigidities in a layout A in increasing order of 21 - 
coordinates, and for any rectangle r £ R, we define its x-rank to be i if r belongs 
to the z-th x-rigidity in this ordering. It is obvious that x-rank of any rectangle 
is an integer between 1 and \R\. We similarly define the y-rank of each rectangle 
in terms of its y-rigidities. Unless otherwise stated, we refer to the rigidities and 
ranks of the initial layout A m whenever these terms are used in the paper. 

Let <f>(p,p') be the segment whose endpoints are points p,p' € P. Then the 
set of segments induced by a set of points P is defined as #(P) = {(f{p,p') : 
p,p' £ P, p ^ p'}, denoted by <P when P is clear from the context. 

We also consider a simpler version of LADR where the set of rectangles 
R consists of unit squares. We call this version as the Layout Adjustment for 
Disjoint Squares problem, and refer to it as LADS for brevity. 

3 Reduction of LADS to Hitting Set 

We formally define a unit grid as follows. Let / : R 2 —► Z 2 be the function 
f(x,y) = (|_x_|, (yj)- The function / induces a partition of 1R 2 into grid cells - 
grid cell (i,j) is the set {p £ K 2 f(p) = ( i,j )}. We call this partition a unit 
grid on M 2 . The ‘grid lines’ are the vertical lines x = a and y = a for integer a. 

Let S be the set of unit squares provided as input to LADS, having initial 
layout A ln . Consider a disjoint, orthogonal order preserving layout A for S . Let L 
be the subset consisting of those grid lines that intersect the minimum bounding 
box of A(S). Let <j> be the line segment connecting the points A(s) and A(s'), for 
some s, s' £ S. Since the layout A is disjoint, A(s) and A(s') lie in different grid 
cells. Thus, there exists at least one line r £ L that intersects <j> . Motivated by 
this, we define a hitting set problem as follows. 

We say a line r hits a line segment 0 if r intersects the relative interior of <f> 
but not either end point of (f>. Thus, if <j> is a horizontal line segment (which would 
happen if s, s' belongs to a y-rigidity), then <f> cannot be hit by a horizontal line 
t £ L. We thus define the Uniform Hitting Set (UHS) problem as follows: 

Definition 1 (Uniform Hitting Set - Decision Problem). Given a set of 
segments <P induced by a point set P and a non-negative integer k, is there a set 
of axis-parallel lines L that hit all segments in <L, such that \L\ < fc? 

Since the area of the minimum bounding box for A(S') is roughly the product 
of the number of horizontal grid lines intersecting it and the number of vertical 
grid lines intersecting it, we also need the following variant. 

Definition 2 (Constrained Uniform Hitting Set - Decision Problem). 

Given a set of line segments, <?, induced by a set of points P , and non negative 
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integers r, c, is it possible to hit all segments in <P with a set of lines L containing 
at most r horizontal lines and c vertical lines ? 

The term ‘uniform’ in the problem name refers to the fact that each segment 
in <P needs to be hit only once by a horizontal or vertical line. We denote the 
problem thus defined as CUHS, and proceed to show its equivalence with a 
constrained version of the layout adjustment problem. 

Definition 3 (Constrained LADS - Decision Problem). Given 
squares S, initial layout A ln , positive integers w, h and a constant 0 < 
is there a layout A' having height H(X') < h + e and width W{ A') < 
satisfying the following conditions? 

1. A' is a disjoint layout. 

2. A ln and A' have the same orthogonal order. 

We term the constrained version of layout adjustment problem as CLADS. 
We now show how to reduce a given instance of CLADS into an instance of 
CUHS. We define as the set of all line segments induced by points in A m (S'). 

Lemma 1. If there is a set of lines L containing at most r horizontal lines and 
at most c vertical lines that hit all segments in <1, then there is a disjoint layout 
A' that has the same orthogonality as X m and whose height and width is bounded 
by h + e and w + e, for any e > 0. Here h = r + 1, w = c + 1. 

To solve LADS by multiple iterations of a procedure for solving CUHS, it 
would be useful to guess the width of a disjoint layout with near-optimal area. 
The following observation allows us to restrict our attention to layouts with 
near integral width. That makes it possible to discretize LADS, by solving a 
constrained version of LADS for all values of widths in {1,2,...,|5|}. 

Lemma 2. Any disjoint layout X can be modified into a disjoint layout X' having 
the same height and orthogonal ordering as X, such that 1U(A')(< W(A)) lies in 
the interval [u>, w + e], where w £ {1,2,..., n.} and e > 0 is an arbitrarily small 
constant. 

We can similarly modify a disjoint layout A into an orthogonal order preserv¬ 
ing disjoint layout A' which has the same width, and whose height lies in the 
interval [h, h + e] for some integer h > 0. Thus, combining the two methods, we 
obtain the following corollary: 

Corollary 3. Any disjoint layout X can be modified into an orthogonal order pre¬ 
serving disjoint layout X', such that W(X')(< W(A)) lies in the interval \w,w-\-e\ 
and H(X')(< H{ A)) lies in the interval [h, h + e\, where w,h £ {1,2,..., n} and 
e > 0 is an arbitrarily small constant. 

Lemma 4. For any e < 1/2, if there is a disjoint layout X' that has the same 
orthogonality as X m and whose height and width is bounded by h + e and w + e 
respectively, where h, w are positive integers, then there is a set of lines L that 
hit all segments in F, containing at most c vertical lines and r horizontal lines. 
Here r = h — 1, c = w — 1. 


n unit 

£ < 1 , 
W + £, 
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All proofs of lemmas in this section are in Appendix A.l. Lemmas 1 and 4 
and Corollary 3 show the close connection between CLADS and CUHS. In subse¬ 
quent sections, we exploit this connection to derive hardness results for CLADS. 


4 Inapproximability of Layout Adjustment Problems 

In this section, we prove APX-hardness of various layout adjustment problems. 
We consider a variant of LADS where instead of minimizing the area, we would 
like to minimize the perimeter of the output layout. We prove an inapprox¬ 
imability result for this problem which readily follows from APX-hardness of 
the Uniform Hitting Set problem. We also show that the decision problem Con¬ 
strained LADS (CLADS) is NP-hard. Recall that in this problem, given an initial 
layout of n unit squares, positive integers w, h, and a constant e > 0 , the goal is 
to determine if there is an orthogonal order preserving layout having height and 
width at most h + e and w + e respectively. To be precise we show a more general 
inapproximability result for this problem. We prove that, given an instance of 
CLADS, it is NP-hard to determine whether there is an output layout of height 
and width at most h + e and w + e respectively, or there is no output layout 
of respective height and width at most (1 + £)(/i + e) and (1 + £)(w + e) for 
some 0 < £ < 1. This result follows from the connection of CLADS with Con¬ 
strained Uniform Hitting Set (CUHS) described in Section 3 and APX-hardness 
of CUHS. The APX-hardness of CUHS follows from the APX-hardness of UHS, 
to which we turn next. 

APX-Hardness of Hitting Set Problem. We consider the optimization version of 
UHS, in which given a set of points P, the goal is to find minimum number of 
vertical and horizontal lines that hit all segments in ^(P). In this section, we 
prove that there is no polynomial time (1 + £)-factor approximation algorithm 
for UHS, unless P = NP, for some 0 < £ < 1. Note that the UHS problem we 
consider here is a special case of the hitting set problem where, given any set 
of segments S, the goal is to find a hitting set for S. This problem is known to 
be NP-hard. But, in case of UHS, given a set of points, we need to hit all the 
segments induced by the points. Thus the nontriviality in our result is to show 
that even this special case of hitting set is not only NP-hard, but also hard to 
approximate. To prove the result we reduce a version of maximum satisfiability 
problem (5-OCC-MAX-3SAT) to UHS. 5-OCC-MAX-3SAT is defined as follows. 
Given a set X of n boolean variables and a conjunction <f> of m clauses such that 
each clause contains precisely three distinct literals and each variable is contained 
in exactly five clauses (m = pp), the goal is to find a binary assignment of the 
variables in X so that the maximum number of clauses of <p are satisfied. The 
following theorem follows from the work of Feige [7]. 

Theorem 5. For some 7 > 0, it is NP -hard to distinguish between an instance 
of 5-OCC-MAX-3SAT consisting of all satisfiable clauses, and one in which less 
than (1 — 7 )- fraction of the clauses can be satisfied. 
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The crux of the hardness result is to show the existence of a reduction from 
5-OCC-MAX-3SAT to UHS having the following properties: 

1. Any instance of 5-OCC-MAX-3SAT in which all the clauses can be satisfied, 
is reduced to an instance of UHS in which the line segments in $(P) can be 
hit using at most k lines, where k is a function of m and n. 

2. Any instance of 5-OCC-MAX-3SAT in which less than 1 — 5 (for 0 < S < 1) 
fraction of the clauses can be satisfied, is reduced to an instance of UHS in 
which more than (1 + -^5)k lines are needed to hit the segments in ’P(P). 

The complete reduction appears in Appendix A.3. The next theorem follows 
from the existence of such a reduction and from Theorem 5. 

Theorem 6. There is no polynomial time (1 + t;)-factor approximation algo¬ 
rithm for UHS with £ < Jg 7 , unless P = NP, 7 being the constant in Theorem 5. 

Now we consider the variant of LADS where we would like to minimize the 
perimeter 2(w + v) of the output layout, where w and v are the width and height 
of the layout respectively. We refer to this problem as Layout Adjustment for 
Disjoint Squares - Minimum Perimeter (LADS-MP). We note that in UHS we 
minimize the sum of the number of horizontal and vertical lines {k = r + c). Thus 
by Lemma 1 and Lemma 4 it follows that a solution for UHS gives a solution 
for LADS-MP (within an additive constant) and vice versa. Hence the following 
theorem easily follows from Theorem 6. 

Theorem 7. No polynomial time (1 + ^') -factor approximation algorithm exists 
for LADS-MP with unless P = NP, £ being the constant in Theorem 6. 

Inapproximability of CUHS. We show that if there is a polynomial time approxi¬ 
mate decision algorithm for Constrained Uniform Hitting Set - Decision Problem 
(CUHS), then P = NP. We use the inapproximability result of UHS for this pur¬ 
pose. See Definition 2 for the definition of CUHS. Now we have the following 
theorem whose proof follows from Theorem 6 and is given in Appendix A.3.1. 

Theorem 8. Suppose there is a polynomial time algorithm that, given <L(P) and 
non-negative integers r,c as input to CUHS, 

(1) outputs ‘‘yes”, if there is a set with at most c vertical and r horizontal lines 
that hits the segments in d>(P); and 

(2) outputs “no”, if there is no hitting set for <L>(P) using at most (l+£)c vertical 
and (1 + £)r horizontal lines, where f is the constant in Theorem 6. 

Then P = NP. 

Inapproximability of CLADS. We show that the existence of a polynomial time 
approximate decision algorithm for CLADS implies P = NP. See Definition 3 
for the definition of CLADS. Now we have the following theorem whose proof 
follows from Theorem 8 and is given in Appendix A.3.2. 


Theorem 9. Suppose there is a polynomial time algorithm that, given S , X m , 
w,h, and £ as input to CLADS, 

(1) outputs “yes”, if there is an output layout X' with H(X’) < h+e and W (A') < 
w + e; and 

(2) outputs “no”, if there is no output layout X 1 with H(X') < (l + £')(/i + e) and 
W(X') < (1 + C){ w + £ )t where £' = | and £ is the constant in Theorem 6. 

Then P = NP. 

5 Approximation Algorithm 

In this section, we describe an approximation algorithm for LADR i.e. for a 
set R of axis-parallel rectangles having initial layout A ln , we need to find a 
disjoint layout of minimum area that preserves the orthogonal ordering of A ln . 
Let Wmax = max{w r | r £ i?} and 7L max = ma x{h r \ r £ R} be the maximum 
width and maximum height, respectively, amongst all rectangles in R. Lemma 2 
showed that if the input consists of a set of squares S, any disjoint layout of S 
can be modified into a disjoint layout having same orthogonality such that its 
width is arbitrarily close to an integer from the set {1,..., |Sj}. It can be seen 
that Lemma 2 can be extended in a straightforward manner for a set of axis- 
parallel rectangles R i.e. any disjoint layout of R can be modified into a disjoint 
orthogonal-order preserving layout having a width that is arbitrarily close to 
an integer from the set {Wmax, Wm ax + 1,..., Wr}, where Wr = w r . We 

r£R 

henceforth state Corollary 3 in the context of LADR as follows. 

Corollary 10. Let Wr = w r and Hr = h r be the sum of the widths 

r£R r£R 

and sum of the heights of all the rectangles in R, respectively. Then, any disjoint 
layout X of R can be modified into an orthogonal order preserving layout X' of R, 
such thatW(X')(< W(A)) lies in the interval \w,w+e\ and H(X'){< H( A)) lies in 
the interval \h, h Ac], where w £ {lUmax? WmaxPU - ■ - -11/?}- h £ { H max • H maK -\- 
1,..., Hr} and e > 0 is an arbitrarily small constant. 

Using Corollary 10, we know that for any disjoint layout A of R, there is a 
corresponding disjoint layout A' having the same orthogonal order as A', whose 
height and width are arbitrarily close to an integer from a known set of integers. 
Hence, we look at all disjoint orthogonality preserving layouts in that range, and 
choose the one with the minimum area as our solution. 

Given positive integers w £ {Umax, Wmax + 1, • • •, Wr}, h £ {ff max , Hmax + 
1, ..., Hr} and 0 < e < 1, we formulate as a LP the problem of whether there 
is an orthogonal order preserving layout A with W(A) < w + £, H{ A) < h + e. 
Recall that a layout A assigns a location A(r) = (x r ,y r ) for the center of each 
rectangle r £ R. The variables of our linear program are U r ^R{x r ,y r }. For any 
two rectangles r,r' £ R, X ™(r) < A ™{r') implies that x r < x' r . We add such a 
constraint for each pair of rectangles in R, both for x-coordinate and y-coordinate 
of the layout. Similarly, we add the constraint x r = x' r for all pair of rectangles 
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r,r' £ R for which A“(r) = A™(r'). These constraints ensure orthogonality is 
preserved in the output layout. 

We now look at constraints that ensure disjointness of the output layout. Let 
r and r’ be two rectangles in the initial layout A ln , having dimensions ( w r , h r ) and 
(w r ',h r >) respectively. We define w(r,r') = Wr + W r' an d h(r,r') = hr+ ^' . Let 


®diff(r, r') 


Xr Xj' 1 •, 

Xf 1 Xf 5 


if A“(r / ) < A“(r) 
otherwise 


. We define y&ui{r,r') analogously. 


If r,r' are disjoint in some layout, then either their ^-projections or their y- 
projections are disjoint in that layout. Equivalently, either the difference in x- 
coordinates of the centers of rectangles r, r' is at least w(r,r '), or the difference 
in y-coordinates of the centers is at least h(r,r'). We thus get the following LP: 




x r 

< 

X r f 

Vr, r' 

£ 

R : 

A“l 

[r) 

< 

AIV) 

( 6 ) 



x r 

= 

x r > 

Vr, r' 

£ 

R : 

A“l 

[r) 

= 

K{r') 

(7) 



Vr 

< 

Vr’ 

Vr, r' 

£ 

R : 

A“! 

[r) 

< 


( 8 ) 



Vr 

= 

Vr' 

Vr, r' 

£ 

R : 

A“! 

[r) 

= 

K(r') 

(9) 

+ T") 

{^Cr 

W r \ 

2 ) 

< 

W + £ 

Vr, r' 

£ 

R : 

:A“I 

[r) 

< 

K(r') 

(10) 


- (yr - 

H r \ 
" 2 ) 

< 

h -\- £ 

Vr, r' 

£ 

R : 

Ay 11 ! 

j) 

< 

K( r ') 

(11) 

£dif f (r, r') 
w(r , r') 

Vdiff ( 

h{r, 

r, r') 
r') 

> 

i 





Vr, 

r £ R 

(12) 


Inequalities ( 6 ) to (9) model the orthogonal ordering requirement for a lay¬ 
out, while Inequalities (10) to (11) restrict the width and height of the layout 
respectively. Since any two rectangles r, r' in a disjoint layout are separated by 
at least half the sum of their widths in the x-direction (w(r, r')) or at least half 
the sum of their heights in the y-direction (h(r, r')), Inequality (12) ensures that 
every such layout is a valid solution for the linear program. We incorporate the 
linear program into Algorithm 1 for solving LADR. 


Algorithm 1 ApproxLADR{R 1 A ln ) 

Input: A set of rectangles R, and initial layout A ln . 

Output: A disjoint layout that has the same orthogonal order as A m . 

1: for w = Umax to Wr do 
2: for h = Umax to Hr do 

3: if LP stated in Inequalities (6) to (12) is feasible then 

4: A Wi h ■<— Layout returned by solution of LP. 

5: if A mln is undefined or A(\ W} h) < A(A mln ) then 

6: A min A„, fc 

7: Define A (A) = 2 ■ A mi n (R) i.e. A(r) = (2 * A“ in (r), 2 * A” in (r)) , Vr € R 
8: return The layout A. 
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Lemma 11. ApproxLADR(R, X ln ) returns a A-approximation for LADR. 


Proof. Let A Wt h be any feasible layout returned by the LP in Line 4, for some 
value of w , h. Let r, r' be two rectangles in R , and assume that A“(r) > A“(r'), 
A l y(r) > A™(r'). (The other cases are symmetric). By Inequality (12), either 

Xi w(r r’q ^ — I or ^"r rq ^ Without loss of generality, assume its the for¬ 

mer. Consider the layout A = 2\ Wj h, as in Line 7. Hence, our assumption that 


Xi w(r r') ^ — 5 implies that A x (r) — A x (r') = 2x r — 2x r > > w(r,r'), which satisfies 
the criteria for disjointness in Inequality (1). Since the final layout A returned by 
the algorithm equals 2 • A w > t h' for some feasible layout \ w \h', A is a disjoint lay¬ 
out that also satisfies the constraints for orthogonal ordering in Inequalities (6) 
to (9). 

Let A* be any disjoint layout preserving the orthogonal ordering of A ln . We 
may assume, by Corollary 10, that its width is at most w' + e and its height 
is at most h! + e, for some integers w' £ {W^max^max + 1,...,Wr}, h! £ 
{H max ,Hinax + 1 ,Hr} and some e > 0. Consider the iteration of the inner 
for loop in Algorithm 1 with w = w' and h = h'. Since A* is a valid solution for 
the LP, the layout A Wj h computed in Line 4 (and hence A min ) has an area that 
is less than or equal to that of A*. The algorithm Algorithm 1 returns a layout 
A (S) obtained by multiplying each of the coordinates in A mm by a factor of 2. 
Hence, the layout A (S) has at most twice the width and at most twice the height 
of A*, ensuring that A(A) < 4 * A(A*). □ 


We note that since Wr. Hr are not polynomial in the input size, the resultant 
algorithm is a pseudo-polynomial time algorithm. But by searching across expo¬ 
nentially increasing value of widths, and thereby losing a small approximation 
factor, we can obtain a 4(1 + o(l)) polynomial time approximation for LADR. 
We conclude by summarizing our result as follows: 


Theorem 12. There is a polynomial time algorithm that returns a 4(1 + o(l))- 
approximation for LADR i.e. given a set of rectangles R and an initial layout 
A m , it returns an orthogonal order preserving disjoint layout whose area is at 
most 4(1 + o(l)) times the area attainable by any such layout. 

We note that our approach can also be used to get a 2(l+o(l)) approximation 
for the problem of finding a layout of rectangles that minimizes the perimeter. 
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A Appendix 


A.l Proofs for Section 3 


Lemma 1. If there is a set of lines L containing at most r horizontal lines and 
at most c vertical lines that hit all segments in <1, then there is a disjoint layout 
X 1 that has the same orthogonality as X m and whose height and width is bounded 
by h + e and w + e, for any e > 0. Here h = r + 1, w = c + 1. 

Proof. For any p £ P, we define p{p) : P Z 2 as p(p) = ( i,j ) where i is the 
number of vertical lines in L to the left of p , and j is the number of horizontal 
lines in L below p. We note that the function p : P —► Z 2 is one-to-one, as 
otherwise, the segment corresponding to two points p and p' with p(p) = p(p') 
would not be hit by any line in L. For convenience, if p(p) = ( i,j ), we denote 
Px(p) = i and p y {p) = j. 

We first consider the placement A (p) = (i + j + \), where i = p x (p) and 
j = p y {p). This ensures disjointness, but not orthogonality - within a single 
column (or row) in A, there could be orthogonality violations w.r.t A m due to all 
endpoints having the same x (or y) coordinate. This is fixed as follows. 

We define P x {i ) = {p £ P \ p x (p) = *}, which represents the i-th column 
of the hitting set. We similarly define P y (j) = {p £ P \ p y {p) = j}- Let afp) 
denote the x-rank of p within P x (i ), and similarly let @j(p) denote the y-rank 
of p within P y (j). For some S > 0, we define the layout A' for a point p with 
Pip) = (hj) as: 

A / (p) = («(l + 5) + i + *-^, j(i + S) + ^+8-^j (13) 


We observe that the maximum value of cti(p) (or /3j(p )) is n. Thus, for any 2 
points p,p' in consecutive columns i.e. p £ P x (i),p' £ P x {i+ 1), \\' x (p) — X' x (jp')\ > 
1, making the corresponding unit squares disjoint. Any 2 points in consecutive 
rows are similarly placed more than unit distance apart, which establishes the 
disjointness of the layout. 

We now show that A ln and A' have the same orthogonal ordering. Let p,p' £ P 
be two unit squares and assume A“(p) < X 1 x (p'). If p x (p) < p x {p'), it is clear 
that X' x (p) < X' x (p') as desired. If p x (p) = p x (p') = i, then it is easy to see that 
AL n b) < 4 n (p') => oti(p) < ai{p') =» X' x (p) < A ' x (p') and A“(p) = X i x (p') => 
oti{p) = oti{p') A x (p) = A x (p')- Similar reasoning applies to the y-coordinates. 

Since L has at most c vertical lines (c < n — 1), for any p,p' £ P 1 |A x (p) — 
X' x {pt)\ < c(l + 6) + S = c + (c + 1)<5. Since this is the maximum difference of 
^-coordinates between any two centers of unit squares in the layout, IT (A') < 
c+l + (c+l)<5 = w+(c+l)<5. Putting the value of <5 = we have IT(A') < w+e. 
Similarly, we can show that H( A') < h + e, thus concluding the proof. □ 


Lemma 2. Any disjoint layout A can be modified into a disjoint layout X' having 
the same height and orthogonal ordering as A, such that IT(A , )(< IT(A)) lies in 
the interval [w, w + e], where w £ {1,2,... ,n} and e > 0 is an arbitrarily small 
constant. 
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Proof. Let s, denote the set of squares in A with cc-rank i, and let S > 0 be 
a parameter. We create a modified layout A' by “compressing” the layout A 
horizontally while keeping the y-coordinates untouched. We set A^.(si) = 1, 
and assign the ^-coordinates of the remaining squares in ascending order of 
their axranks in A. We claim that it is possible to place squares in S; such that 
A ' x (si) £ [b, b + (i — 1) • <5], where 0<K«-le Z + U {0}. 

For the base case, we place the squares in s 2 . If the projection of s 2 on the 
y-axis is not disjoint from that of Si, then set A^(s 2 ) = A^Si) + 1- H not, set 
•^x( s 2) = A^.(si) + S. In both cases, si and s 2 do not satisfy Inequality (1) in 
layout A', and hence are disjoint in A'. 

Assume inductively that for j < i — 1, we have defined A r x (sj) such that 
A r x (sj) £ [6, b + (j — 1) • 5] for some 0 < b < j — 1. We now place s*. Set 
A^,(si) = 00 initially. Consider the ^-projections of Sj and s^_ 1 in A. If they are 
not disjoint, then set A r x (si) = A^(si_i) + 1. If not, Si can be translated to the left 
till either Af^S;) = A^.(sj_i) + <$ or it touches some square in sp, i’ < i, whichever 
happens first. In the former case, A ' x (si) £ [b, b + (i — 1) • (5] and b < i — 1. In the 
latter case, sy satisfies the induction hypothesis and hence A^(s,)(= X' x (si>) + 1) 
is in the interval [b' + 1, b' + 1 + (*' — 1) • 5], where b' < i' — 1 < * — 1. Hence, in 
all cases, X x {si) satisfies the induction claim, and hence we claim it is true for 
all i < n. 

It remains to bound the total width of the layout A'. Since A^,(si) = 1 and 
A ' x (si) < b + i ■ 6 for some integer b < i — 1, it follows that the width of the layout 
is in [b,b + n ■ S] for some integer b. Setting 6 = ^ concludes the proof. □ 

Lemma 4. For any e < 1/2, if there is a disjoint layout X that has the same 
orthogonality as A m and whose height and width is bounded by h + e and w + e 
respectively, where h, w are positive integers, then there is a set of lines L that 
hit all segments in <b>, containing at most c vertical lines and r horizontal lines. 
Here r = h — 1, c = w — 1. 

Proof. Assume without loss of generality that X is contained in a rectangle 
[0, w + e] x [0, h + e] in the plane. We consider the following set of lines: 

x = i+^+5, for i = 1, 2,..., w— 1 and y = j+^+S, for j = 1, 2,..., h— 1. 

We pick a <5 < ^ so that none of these lines contain a rectangle center. We now 
show that the resultant set of lines L constitute a hitting set for d>. 

Let (j> £ d> be an arbitrary line segment, whose endpoints p,p' are the centers 
of unit squares s, s' in the disjoint layout A'. Since s, s' do not intersect, then 
either |A^.(s) — A^,(s')| > 1 or |A' y (s) — A' y (s')| > 1. We assume without loss of 
generality its the former. We know that 0 < A^.(s), A^(s') < w + e and none 
of the vertical lines in L pass through p,p'. Since successive vertical lines are 
unit distance apart, there must be at least one vertical line x = ^ + S that lies 
between p,p'. We can argue similarly if \X y (s) — A/(s')| > 1 using the set of 
horizontal lines in L. Thus, d> is hit by at least one line in L, making the latter a 
hitting set consisting of at most c = w — 1 vertical lines and r = h — 1 horizontal 
lines. □ 
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A.2 Proof of Theorem 6 


Theorem 6. There is no polynomial time (1 + £)-factor approximation algo¬ 
rithm for UHS with f unless P = NP, 7 being the constant in Theorem 5. 

Proof. Suppose there is a polynomial time (l+£)-factor approximation algorithm 
for UHS for some £ < ^ 7 . Now consider the reduction 7 r. For the instance of 
UHS obtained from an instance of 5-OCC-MAX-3SAT in which all the clauses are 
satisfied, there is a hitting set of size at most k ; so the approximation algorithm 
finds a hitting set of size at most (1 + £)k. For any instance obtained from an 
instance of 5-OCC-MAX-3SAT in which less than 1 —7 fraction of the clauses are 
satisfied, needs more than (1 + P 7 )k > (1 + £)fc lines. Thus using this algorithm 
we can distinguish between an instance of 5-OCC-MAX-3SAT consisting of the 
clauses all of which can be satisfied, and one in which less than 1 — 7 fraction 
of the clauses can be satisfied. Hence from Theorem 5 it follows that P = NP, 
which completes the proof of this theorem. □ 

A.3 Reduction of 5-OCC-MAX-3SAT into Hitting Set Problem 

A set L of horizontal and vertical lines is said to separate a set P of points, if 
for each point p £ P, there is a 2-dimensional cell (possibly unbounded) in the 
arrangement of the lines in L that contains p and no other points of P. 

Recall that for any set of points P, <P(P) is the set of segments induced by P. 
Suppose a set L of horizontal and vertical lines hits all the segments of <£(P). By 
definition a line hits a segment, if it passes through the interior of the segment, 
but does not intersect either endpoint of the segment. Thus if we perturb each 
line in L that passes through a segment endpoint, then no line in L intersect 
any segment endpoint and L still remains a hitting set for ^(P). Henceforth, 
by a hitting set of lines L for ‘T(P) we mean the lines in L hit the segments in 
#(P), but no line in L intersects any point of P. Now our claim is that if L is 
a hitting set for ^(P), L separates P. If not, then there exists a cell containing 
two points, and the corresponding segment is not being hit by any line, which 
cannot be true. Conversely, if L separates the points of P, then all the segments 
of <2>(P) are being hit. The notion of separation will help us to simplify our 
arguments for bounding the minimum number of lines required to hit all the 
segments induced by a point set. Thus from now onwards, we use the notion of 
hitting and separation interchangeably. 

For any set of segments S and a hitting set of lines L for S' D S, consider the 
subset of L consisting of every line that hits at least one segment of L; denote its 
cardinality by Nl(S). For any set of segments S, denote the minimum number 
of lines needed to hit all segments of S by N(S). Given a set of lines L, a point 
set Pi is said to be separated from another point set P 2 , if each segment (p,p') 
is hit by some line in L, where p £ P\,p' € P 2 . For any two point sets P\ and 
P 2 , denote the minimum number of lines needed to separate Pl from P 2 by 

N(p ll p 2 ). 

For any set P of points, define x-span of P to be the interval [x m i n , £ max ] 
on real line, where x max and a; m in are the maximum and minimum among the 
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^-coordinates of the points of P. Similarly, define y-span of P corresponding to 
the y-coordinates of its points. Now we have the following lemma. 

Lemma 13. Suppose Pi,..., Pi are point sets with pairwise disjoint x-spans 
and pairwise disjoint y-spans. Then for any hitting set of lines L for a set of 
segments S 2U‘=i 

l l 

N L (\j<P(p l )) = Y / N L mm 

i=1 i—1 

Proof. Let Li be the subset of L consisting of every line that hits at least one 
segment in for i = 1,..., l. Note that it is sufficient to show that for any 

i, j £ { 1 ,. .., 1} such that i ^ j, Li D Lj = cj>. Consider any horizontal line y = a 
of Li. a must lie in y-span of P\. As y-spans of Pi and Pj are disjoint, this line 
cannot hit any segment of <P(Pj), and thus cannot belong to Lj. Similarly, any 
horizontal line of Lj cannot hit any segment of Pi ), and thus cannot belong 
to Li. Now consider any vertical line x = b of Li. b must lie in x-span of Pi. As 
x-spans of Pi and Pj are disjoint this line cannot hit any segment of <L{Pj), and 
thus cannot belong to Lj . In a similar way, any vertical line of Lj cannot hit any 
segment of ^{Pi), and thus cannot belong to Li. Hence Li n Lj = <f>. □ 



Fig. 1: (a) Optimal configurations of hitting set for variable (xi) points. The top 
one corresponds to Xi = 0 and the bottom one corresponds to Xj = 1. (b) The 
points corresponding to the clause X 3 VX 2 V 2 T 1 . The hitting set for variable gadgets 
corresponds to the assignment X\ = 1 ,X 2 = 0 ,X 3 = 1 


Now we discuss the construction of an instance of UHS from a given instance 
of 5-OCC-MAX-3SAT. We construct a gadget that consists of three classes of 
points - variable points, clause points and auxilliary points. 

Variable Points For each variable we take a translate of the six points in 
I = {(1,4), (2, 2), (3, 6 ), (4,1), (5, 5), ( 6 ,3)}. We note that three lines are nec¬ 
essary and sufficient to separate these six points. To be precise there are two 
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optimal choices one involving a horizontal line y = a, where 4 < a < 5 and the 
other involving a horizontal line y = b, where 3 < b < 4. See Figure la for an 
illustration. A set of points U' is called (p, g)-translate of a point set U , if U' is 
obtained by translating the x and y-coordinates of the points in U by p and q 
respectively. For each variable awe take a set V. which is ((6 (i — 1), 6 (* — 1))- 
translate of I, where 1 < i < n. Let V = U" = 1 Vj;. 

We note that the x-spans (resp. y-spans) of the sets V t for 1 < i < n are 
pairwise disjoint. With respect to V for 1 < i < n, let us call a horizontal line a 
Type 0 (resp. Type 1) line, if it has the form y = b for 6(i—1)+3 < b < 6(i — 1)+4 
(resp. y = a for 6(i — 1) + 4 < a < 6(i — 1) + 5). Now we have the following 
observation. 

Observation 4. Three lines are necessary and sufficient to hit all the segments 
induced by V, i.e N(<P(V)) = 3, where 1 < i < n. In addition, there are exactly 
two types of optimal hitting sets for ^{Vf) - one that uses a Type 1 line, but no 
Type 0 line, and the other that uses a Type 0 line, but no Type 1 line. 

We will associate the first choice of hitting set for <P(V.) with the assignment 
Xi = 1 and the second choice with Xi = 0. Now we have the following lemma. 

Lemma 14. 4n — 1 lines are sufficient to hit all segments of4>(V), where the 
segments of ^(Vf) are being hit using 3 lines for 1 < i < n. 

Proof. Note that separating V involves separating V t from V i+1 for each 1 < 
i < n — 1, and separating points of Vi for each 1 < i < n. By Observation 4 
three lines are sufficient to separate the points of V- Also N(V,V+i) = 1, as 
any vertical or horizontal line between V. and Vj+i separates one from the other. 
Thus in total 3?r + n — 1 = 4n — 1 lines are sufficient. □ 

Given any hitting set for ^(V) with at most 4n — 1 lines we want to con¬ 
struct an assignment of the boolean formula f>. To be precise we want to ensure 
that always 3 lines are used to separate V(, if at most 4n — 1 lines are used to 
separate V. This ensures that one of the two configurations of 3 lines are used 
to separate V.- Then we can assign binary values to the variables depending on 
the configuration used. Figure 2 shows an example where 4n — 1 lines are used 
to separate V, but 4 lines are used to separate one of the V t . To ensure exactly 
3 lines are used to separate each V, , we can make sure that n — 1 lines between 
V. and V + i are always picked up, for 1 < i < n — 1 . Then as 3 lines are needed 
to separate each V., exactly 3 lines are used for each Vi, if 4n — 1 lines are used 
in total. Later we show that a subset of the auxilliary points will enforce this 
constraint. 

Clause Points For each clause Cj we take a set Tj of ten points that is a union 
of two sets Tj and Tj. Tj consists of a translate of the four points in J = 
{(2, -1), (5, -2), ( 6 , -3), (9, -4)}. To be precise Tj is ((6 n + 1 + 10(j - 1), 4(j - 
m))-translate of J. In Figure 3 p,q,r and s are the points of Tj corresponding 
to the clause £3 V X 2 V X\. The points in Tj depend on the literals of Cj. Let 
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Fig. 2: 11 lines are used to separate V, but 4 lines are used to separate V 2 


u± , U2 and U3 are the literals of Cj. For each literal we take two points. The 
^-coordinates of the points corresponding to u\,U 2 and U 3 are {6 n + 1 + 10 (j — 
1) +1, 6 n +1 +10 (j - 1) + 3}, {6n +1 +10 (j - 1) + 4,6n +1 + 10(j - 1) + 7} and 
{6?r + 1 + 10 (j — 1) + 8,6n + 1 + 10(j — 1) + 10} respectively. The y-coordinates 
of these points depend on the form of the literals. Let ut is corresponding to 
the variable a,’; for t £ {1,2,3}. If u t = athe ^-coordinates of the two points 
corresponding to ut are 6(z — 1) -(- 5 — ej and 6 (i — 1) + 4 + ej respectively, where 
e is a very small positive number. Otherwise, Ut = Xi and the y-coordinates are 
6(i— l) + 4— ej and 6 (i— l) + 3 + ej respectively. Let T 1 = UjL-^Tj, T 2 = U jLiTj 
and T = T 1 UT 2 . 

In Figure 3 the points corresponding to the literals 3:3, X 2 and X\ are {a, &}, 
{c, d} and {e,/} respectively. Consider the points of Tj in decreasing order of 
y-coordinates. We divide the points in this order into groups of two. Thus the 
first three groups are corresponding to T 2 and the remaining two groups are 
corresponding to Tj. We denote the first three groups by T 2 S for 1 < s < 3, and 
the remaining two groups by Tj t for 1 < t < 2. We note that the y-spans of all 
these five groups are pairwise disjoint. Also a;-spans of the three groups of Tj are 
pairwise disjoint. Similarly, a:-spans of the two groups of Tj are pairwise disjoint. 
But the a;-spans of Tj 1 and Tj 1 , and Tj 1 and Tj 2 have nonempty intersection. 
Thus it is possible to separate the points in Tj 1 as well as T 2 l (resp. Tj l as well 
as Tj 2 ) using a single vertical line. Similarly, it is possible to separate the points 
in Tj 2 as well as Tj 2 (resp. Tj 2 as well as Tj 3 ) using a single vertical line. See 
Figure 3 for an illustration. We note that x-spans (resp. y-spans) of the sets Tj t 
for 1 < t < 2 and 1 < j < m are pairwise disjoint. 

While constructing the points for C\ we make sure that a literal correspond¬ 
ing to Vj is never chosen as the first literal of C\. Note that this does not violate 
any generality, as each clause contains three distinct literals. Later this will help 
us to argue about the number of lines necessary to separate all the points. 




















• 

a 

• • 



• *b 

Xi . 

• 

• 



• 

• Cm 

X 2 . 

• 

• 

d • 


• 

• 

Cm 

•/ 

p. 

q 

m 



• r 

s 

• 


Xi V X 2 v X3 


Fig. 3: The points corresponding to the clause 13 V 12 V X\ and a scheme to 
separate them 


Now we discuss a scheme to separate the points of Tj using horizontal and 
vertical lines. The scheme considers the points in groups of two as defined before. 
At first we will see how to separate the groups from each other and then we will 
separate the points in the individual groups. Note that four horizontal lines are 
sufficient to separate these five groups of points from each other (see Figure 3). 
Also suppose the points in at least one group corresponding to the literals are 
separated by one additional horizontal line. Then two lines are necessary to 
separate the remaining points of Tj and two vertical lines are sufficient for that 
purpose. Hence we have the following observation. 

Observation 5. Suppose the five groups in Tj as defined above are separated 
from each other by four horizontal lines and points in at least one of the groups 
of Tj are separated by a horizontal line, then two more lines are necessary to 
separate all the points ofTj and two vertical lines are sufficient for that purpose. 

Suppose that as a precondition we ensure that the five groups of Tj are 
separated from each other for each 1 < j < m. Furthermore, suppose that 
we have fixed an optimal hitting set type for each Vi for 1 < i < n. If the 
corresponding assignment satisfies the clause Cj , we can argue that the hitting 
sets for the <P(Vi) have a horizontal line that separates the points in at least one 
of the three groups of Tj . Then using Observation 5 two additional vertical lines 
suffice to separate the points in Tj. If the corresponding assignment does not 
satisfy Cj, then the points in none of the three groups of Tj are separated by 
the optimal hitting sets for the T>{Vi). Then at least three more lines ae needed 
to separate the points in Tj. 
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Now we discuss how the preconditions could be satisfied. The three groups 
of Tj can be separated from each other by the horizontal lines which are used 
to separate V. and Vj i+ \ for 1 < i < n. An additional horizontal line would 
be sufficient to separate Tj from Tj for all 1 < j < to. Now if a vertical line 
separates the two groups of Tj from each other, then it also separates the points 
in the middle group of Tj. Thus we cannot argue that two more lines are needed 
to separate the points of Tj. Hence we have to ensure that the two groups of Tj 
are separated using a horizontal line. We will enforce this constraint by a subset 
of auxilliary points. 

Auxilliary points The set of auxilliary points A consists of five point sets {A 1 ,..., A 5 }. 
A 1 and A 2 ensure that Tj 1 and Tj 2 are separated by a horizontal line for 
1 < j < m. A 3 ensures that the points in each Vi are separated using exactly 3 
lines, when 4n — 1 lines are used to separate the points in V. A 4 ensures that 
Tj is separated from T J+1 by a vertical line for each 1 < j < m. A 5 consists of 
four points (6 n + .5, .5), (6n + .5, —.5), (6n + 1.5, .5) and (6n + 1.5, —.5). Note 
that 2 lines are necessary and sufficient to separate these four points. Moreover, 
a horizontal and a vertical line that separate these points also separate T 1 from 
T 2 , and V from T respectively (see Figure 4). 

A 1 is composed of m point sets each of which consists of a translate of the 
four points in F — {(0, —e), (—1, —1 + e), (—2, —2e), (—3, —1 + 2e)}, where e > 0 
is a very small number. A 1 = U jLiAj, where each A] is (—4m + 3 + 4 (j — 

1),—4m + 2 + 4(j — l))-translate of F (see Figure 4). Note that two lines are 
necessary, and one horizontal and one vertical line are sufficient to separate the 
points in each A 4 . By construction, the horizontal line corresponding to Aj also 
separates the two groups of Tj from each other (see Figure 4). 

A 2 is composed of m — 1 point sets each of which consists of a trans¬ 
late of the four points in H = {(—e, 0), (—1 + e, —1), (—2e, —2), (—1 + 2e, —3)}. 

A 2 = U^ 1 A 2 , where each A 2 is (—4m + 4 l, —8m + 7 + 4 (l — l))-translate of H 
(see Figure 4). Note that one horizontal and one vertical line are sufficient to 
separate the points of each A 2 . Moreover, the vertical line corresponding to A 2 
also separates the two groups Aj and Ah, for 1 < l < m — 1. And, any vertical 
line that hits a segment in ‘P(Aj) separates A 2 ^ from A 2 for 2 < l < m — 1. 

A 3 consists of n — 1 point sets each of which is a translate of the points in F. 

A 3 = U jZ-jAj, where A 3 is (—4m — 1 — 4(1 — 1), 7 + 6(1 — l))-translate of F. Thus 
the points in A 3 lie towards north-west of the points in A 1 (see Figure 4). Note 
that one horizontal and one vertical line are sufficient to separate the points in 
each Af. Also the y-span of A 3 is between the y-spans of V/ and V+j for each 
1 < l < n — 1. Thus the horizontal line corresponding to A 3 also separates Vi 
from Vz+i for 1 < l < n — 1. 

A 4 consists of to — 1 point sets each of which is a translate of the points in 
H. A 4 = U™T l 1 A 4 , where A 4 is (—8m + 3 — 4 (j — 1), 6n + 2 + 10j)-translate of 
H. Note that two lines are necessary, and one horizontal and one vertical line 
are sufficient to separate the points in each A 4 . In addition, the ic-span of A 4 is 
between the £-spans of Tj and Tj +1 for each 1 < j < m — 1. Thus the vertical 
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line corresponding to A 4 also separates Tj from Tj + 1 for 1 < j < m — 1 (see 
Figure 4). 

Now consider the collection of sets A] for 1 < l < m, A 2 for 1 < s < m — 1, 
A% for 1 < t < n — 1, A 4 for 1 < j < m — 1, Tj 1 for 1 < j < m and Tj 2 
for 1 < j < in. By construction, x-spans (resp. y-spans) of any two sets in this 
collection are disjoint. Also y-spans (resp. rr-spans) of the sets V t for 1 < i < n 
and Af for 1 < t < n — 1 are pairwise disjoint. Similarly, j/-spans (resp. £-spans) 
of the sets Tj for 1 < j < m and Aj for 1 < j < m — 1 are pairwise disjoint. As 
£-spans (resp. j/-spans) of V and A 1 U A 2 , V and A 4 U A 5 , and V and UjLjTj 
are disjoint, we have the following observation. 

Observation 6. Consider the collection of sets A\ for 1 < t < m, Af for 
1 < l < m — 1, Ag for 1 < s < n — 1, A 4 for 1 < j < m — 1, Vi for 1 < i < n — 1, 
Tji for 1 < j < m, Tj 2 for 1 < j < m and A 5 , x-spans (resp. y-spans) of any 
two sets in this collection are disjoint. 

Now we have the following lemma regarding the properties of the auxilliary 
points. 



Fig. 4: Construction of points corresponding to the clauses {^3 V X 2 V x\,x\ V 
X 2 V £ 3 , X 2 V Xi V 0 ^ 3 } (n — 3 , m — 3). 


Lemma 15. The points of A have the following properties: 

i) 4m — 2 lines are necessary to hit the segments in <P(A 4 U A 2 ). Moreover, 
there is a hitting set for ^(A 1 U A 2 ) consisting of 4 to — 2 lines that also 
separates the two groups of Tj from each other for 1 < j < to. 
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ii) 2(n— 1) lines are necessary and sufficient to hit the segments in UfL] 1 
Hi) 2 ( 771 —1) lines are necessary and sufficient to hit the segments in Ujl] 1 ^>(Aj)) 

Proof, i) Consider any hitting set of lines L for the segments in ^(A 1 U A 2 ). 
At first we show that Ni J (<P(A 1 U A 2 )) > 4to — 2. Note that 7V(^(Aj)) = 
N(<P(A 2 )) = 2 for 1 < j < to, 1 < l < to — 1. Also x-spans and y-spans of 
the sets A] for 1 < j < to and A 2 for 1 < l < m — 1 are pairwise disjoint. 
Hence by Lemma 13, U A 2 )) > Nl(\J'” =1 4>(A 3 ) U ^{A 2 )) = 

ET =1 Nl^A))) + YZi 1 Nl^A 2 )) > ET=i N^A])) + ETJi 1 N{${A 2 )) = 

2 (to + to — 1) = 4to — 2. 

Now we construct a hitting set of lines for ^(A 1 U A 2 ) having at most 4m — 2 
lines. We take one horizontal and one vertical line to separate the points of each 
of the sets A] and A 2 for 1 < j < m, 1 < l < m — 1. We need 4m — 2 lines for 
this. Now the vertical line corresponding to Aj also separates A 1 from Ah 1 for 
1 < l < to — 1. And, the vertical line corresponding to Aj also separates the 
groups A 2 _ 1 and A 2 for 2 < j < m — 1. Thus all the points in A 1 U A 2 are sep¬ 
arated. By construction, the horizontal line corresponding to Aj also separates 
the two groups of Xj 1 from each other, and the result follows. 

ii) First we prove that 2 (n — 1) lines are necessary. Consider any hitting set 
of lines L for the segments in Ut^i 1 ))■ We note that N(<1>(A^)) = 2 for 
1 < t < n — 1. Also i-spans and y-spans of the sets A^ for 1 < t < n — 1 are 
pairwise disjoint. Thus using Lemma 13, 

n— 1 n— 1 n— 1 

N l ( (J <P(A 3 t )) = J2 N L mA 3 t )) > ]T N($(A 3 )) = 2(77 - 1). 

t=i t=i t=i 


To prove the sufficient condition we construct a hitting set of lines for the 
segments in (Jj'Tj 1 <P(A)). For each l<t<ri — lwe take one horizontal and one 
vertical line that separates the points in A). Hence the 2{n— 1) lines in total hit 
all the segments in |J El )■ 

Hi) Consider any hitting set of lines L for the segments in (JjjJi 1 ^(^j))- We 
note that N(d>(A))) = 2 for 1 < j < to — 1. Also x-spans and y-spans of the sets 
Aj for 1 < j < to — 1 are pairwise disjoint. Thus using Lemma 13, 

m —1 m —1 m— 1 

Nl( (J <?(Aj)) = N d^ 3 )) > Y, N(${A a 3 )) = 2 (to - 1). 

l=i l=i l=i 

Now to prove the sufficient condition we construct a hitting set for the seg¬ 
ments in [S)Zi ^(Aj) consisting of 2 to — 2 lines. For each 1 < j < m — 1 we take 
one horizontal and one vertical line that separates the points in Aj. Hence the 

2 (to — 1 ) lines in total hit all the segments in Ujli 1 ^(Aj). □ 
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Now we formally discuss the reduction from 5-OCC-MAX-3SAT to Hitting 
Set Problem. Given a 5-OCC-MAX-3SAT formula we construct the set of 
points V and T corresponding to variables and clauses respectively in the way 
mentioned before. We also construct A, the set of auxilliary points, in the same 
way mentioned before. Let P = V U T U A. Let k = 5n + 8 m — 4. Now we 
show that the reduction has the two properties as claimed before. The following 
lemma ensures the first property. 

Lemma 16. Suppose 4> is satisfiable, then N(<P(P)) < k. 

Proof. Note that it is sufficient to show the existence of a set L of at most 
k horizontal and vertical lines which separate the points of P. The set L is 
constructed in the following way. L is composed of three disjoint sets of lines 
L V ,L C and L a . L V ,L C and L a are the sets of lines which separate the points of 
V, T and A respectively. These lines also separate the groups V, T and A from 
each other. Now we will consider the construction of these sets. 

We add the horizontal line y = 0 and the vertical line x = 6 n + l to L a . These 
two lines divide the plane into four cells. Each cell contains one point of A 5 and 
some additional points. Thus the points in A 5 are separated by these two lines. 
In addition to the points of A 5 the four cells contain the points of four disjoint 
sets A 1 U A 2 , V U A 3 , T 1 U A 4 , and T 2 respectively. Hence the four point sets in 
the four cells are already separated from each other. We show how to separate 
the points in each of these four sets. 

By Lemma 15(i) it follows that there is a hitting set for <?(A 1 UA 2 ) consisting 
of 4m — 2 lines that also separates the two groups of Tj from each other for 
1 < j < m. We add such 4m — 2 lines to L a . Now \L a \ = Am. 

We note that the vertical line correponding to A} separates A 3 from V. By 
Lemma 15 (ii) 2 (n — 1) lines are sufficient to hit the segments in ^(A 3 ). We 
add such a set of 2n — 2 lines to L a . Now \L a \ = Am + 2n — 2. Note that the 
horizontal line corresponding to A 3 separates Vj from Vj+i for 1 < i < n — 1. By 
Observation 4 three lines are sufficient to separate the points of Vi and only two 
configurations of lines are possible for that purpose. For each variable a,’;, if ay is 
1, we select the configuration with the Type 1 horizontal line y = 6 (i — 1) + 4.5 
for Vi. If Xi is 0, we select the configuration with the Type 0 horizontal line y = 
6(i— 1) + 3.5 for V,. We add the 3 lines corresponding to the chosen configuration 
to L v for each Vj. Thus \L V \ = 3 n. Hence the points in V are now separated. 
Also at least one horizontal line has been chosen per V t and thus A 3 is separated 
from A 3 +1 for each 1 < t < n — 2. Thus all the points in V U A 3 are now 
being separated. Note that a vertical line corresponding to V n and a horizontal 
line corresponding to Vi separate V U A 3 from A 4 . Also the same vertical line 
separates A 1 U A 2 from A 4 . 

By Lemma 15(iii) 2(m— 1) lines are sufficient to hit the segments in ^(A 4 ). We 
add such a set of 2m—2 lines to L a . Now \L a \ = 4m+2n—2+2m—2 = 6m+2n—4. 
Note that the vertical line corresponding to A 4 also separates Xj from Tj +1 for 
all 1 < j < m — 1. Also the horizontal line corresponding to A 2 separates A 4 
from Tj. 
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Now consider the points in T. The horizontal lines that separate V) from 
V i+ i for all 1 < i < n — 1, also separate the three groups T 2 X ,T 2 2 and T 2 A from 
each other, for 1 < j < to. Also the hoizontal lines in L a corresponding to A 1 
separate Tj x from Tj 2 for 1 < j < m. Now we claim that for each 1 < j < m 
the points in at least one of the groups of T 2 are being separated by a horizontal 
line in L v . Consider any clause Cj = u\ V U 2 V U 3 . Now as Cj is satisfied there 
must be a satisfied literal of Cj. Without loss of generality let iti be a satisfied 
literal of Cj and it is corresponding to the variable Xi. If u\ is Xi, the points 
in T 2 corresponding to u\ are (6 n + 1 + 10(j — 1) + 1,6(z — 1) + 5 — ej) and 
(6n + 1 + 10(j — 1) + 3, 6 (i — 1) + 4 + ej). As Xi is 1, previously we chose the 
configuration with the Type 1 line for V). Thus these two points are already 
separated. Otherwise, if u\ is X{. the points in TJ corresponding to u\ are (6n + 
1 + 10 {j — 1) + 1,6(z — 1) +4 — ej) and (6?z + 1 + 10(j — 1) + 3, 6(z — 1) + 3 + ej). 
Now as Xi is 0, previously we chose the configuration with the Type 0 line for 
Vi. Thus in this case also the two points are already separated. 

Consider the set of points Tj for any 1 < j < m. All the five groups of 
Tj are separated from each other, and the points in at least one group of T 2 
are separated. Thus by Observation 5 only two vertical lines are sufficient to 
separate the points in Tj. We construct such a set L c containing 2m vertical 
lines corresponding to Tj for all 1 < j < m. As the points in Tj are separated 
for all 1 < j < m, and Tj is separated from T )+1 for all 1 < j < m, all the points 
in T are separated. 

Also the vertical line corresponding to T^ 1 and the horizontal line corre¬ 
sponding to A^ separate A 4 from T 1 . Now the first literal of C\ cannot be 
corresponding to V-\. Thus the vertical line corresponding to Th and a hori¬ 
zontal line corresponding to V\ separate A 4 from T 2 . Lastly, the vertical line 
corresponding to Tj A separate A)_ :l from A j for 2 < j < m — 1. 

We set L = L a U L v U L c and thus the lines of L separate all the points of 
P by our construction. Now \L\ = (6 to + 2n — 4) + 3n + 2 to = 5n + 8 to — 4 = k 
which completes the proof of the lemma. □ 

Now we show that the reduction ensures the second property as well, i.e if less 
than 1 — <5 (0 < 6 < 1) fraction of the clauses of j> are satisfied, then more than 
(1 + gi r 8 )k lines are needed to hit the segments in ^(P). The idea is to show that 
any hitting set of lines for <5(P) that uses exactly 3 lines to seprate the points in 
Vi, uses at least k lines. Thus if at most k + p lines are used to hit all segments 
in $(P), then more than 3 lines can be used for at most p Vjs. Now if 3 lines are 
used to separate the points in Vi, then we can find a binary assignment of the 
variable. Using the exact three lines configurations of at least n — p remaining 
Vi’s we can create an assignment (the values of the other variables are chosen 
arbitrarily) which makes a good fraction of the clauses to be satisfied. Before 
moving on now we have some definitions. 

Consider any hitting set of lines L for ^(P). A variable ais said to be good, 
if the number of lines of L that hit at least one segment of ( I>(Vj) is equal to 3. 
A variable is said to be bad, if it is not good. A clause Cj is said to be bad, if it 
has at least one of the following two properties, (i) any line of L that hits the 
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segment of <P{Tj t ). is vertical, for each 1 < t < 3, and (ii) it contains a literal 
corresponding to a bad variable. The bad clauses which have the first property 
are called bad clauses of first type, and the remaining bad clauses are called bad 
clauses of second type. A clause is said to be good, if it is not bad. Now we have 
the following lemma. 


Lemma 17. Consider any hitting set L for <P(P) consisting of at most k + p 
lines. The number of bad clauses corresponding to L is at most 5 p. 


Proof. Let a be the number of good variables. Then 


n 

N L (<P(Vi)) > 3a + 4 (n — a) = An — a (14) 

i =1 


As each variable appears in exactly 5 clauses the bad variables can appear in 
at most 5(n — a) clauses. Let b be the number of bad clauses of first type. Then 
the total number of bad clauses is at most 5 (n — a) + b. Now consider a bad clause 
Ci of first type. A vertical line that separate the points in Tf t might not separate 
the points of any of X) 1 and T^ 2 . Let C' be the subset of the bad clauses which 
has one such corresponding vertical line, and denote the cardinality of C' by b\. 
Then each of the three vertical lines, corresponding to any of the remaining b—b\ 
bad clauses C s separate the points in either T' 1 or T] 2 . Now for any clause Cj 
£-spans (resp. y-spans) of the sets Tj A and Tj 2 are disjoint. Thus at least two 
lines are needed to separate the points in Tj 1 and Tj 1 2 . This implies 


£ NlWTU) + £ >2 m + (b- 6r) (15) 

3 =1 3 = 1 


By Observation 6 the sets A\ for 1 < t < m, Af for 1 < l < m — 1, for 
1 < s < n — 1, Aj for 1 < j < m — 1, V) for 1 < i < n — 1, Tj ± for 1 < j < to, 
Tj 2 for 1 < j < to and A 5 have pairwise disjoint x spans and pairwise disjoint 
y-spans. Applying Lemma 13, Equation (14) and Equation (15) we count a lower 
bound of Nl(<P(P)) which is as follows. 
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Nl($(P)) 

m m— 1 n— 1 m— 1 n 

> N l ( J <P(Al) U |J <P(Af) U |J ${A\) U |J P(Af) U |J 0(V;)U 

t =1 Z—1 s=l j=l i —1 

m m 

U^JuU^U^A 5 )) 

j-l 2-1 

m m—1 n—1 m—1 

= 53 jVi(<P(4)) + 53 n l ($(a1)) + 53 N L ($(A 3 s j) + 53 iv L (<i>(A 4 ))+ 

Z—1 Z=1 s=l j=l 

n m m 

53 ^(^m)) + 537 v i (^ 1 )) + 53 ^ i (<i>(i; ] ;2 )) +jv l (<? ( 24 5 ) ) 

i=l 2=1 2=1 

m m— 1 n—1 m—1 

> 53 N($(Ai)) + 53 N(<p(Af)) + 53 jv(«p(^)) + 53 n^a^h 

Z—1 Z=1 s—1 j=l 

(4n - a) + (2m + b - h) + N($(A 5 )) 

m m— 1 n— 1 m—1 

= E 2 +E 2 +E 2 +E 2 + (4n — a) + (2m + b — b\) + 2 

t=i /=i s—1 j=l 

= 2 to + 2to — 2 + 2n — 2 + 2m — 2 + 4n — a + 2m + b — b\ + 2 

= ( 8 m + 5n — 4) + (n + b — a — bi) = k + (n + b — a — b\). 

Now consider any clause Cj in C. There is at least one vertical line lj in L 
that separate the points in one of the groups of T'j , but do not separate the points 
in any of Tj and T 4 2 . Let lj separates the points in T? t for some 1 < t < 3. 
The ^-coordinates of the points in T? t are 6 n + 1 + 10(j — 1) + 3(t — 1) + 1 and 
6 n + 1 + 10(j — 1) + 3(t — 1) + 3. Then the equation of lj is x = c for some 
6 n +1 + 10(j — 1) + 3(t — 1) +1 < c < 6 n +1 + 10(j — 1) + 3 (t — 1) + 3. We claim 
that the vertical line lj is not considered in the previous counting of lower bound 
of Nl(<1>(P)). If not, then one of the groups considered in the previous counting 
must contain two points (p 4 ,p 4 ) and (p^iPy) such that p x < c and p x > c. The 
maximum among the ^-coordinates of the points in A\ A 4 is 6 n + 1.5 < c, which 
is actually a point of A 5 . Thus none of the subsets of A\ A 4 contain such points. 
Also the a;-spans of A 4 and Tj are disjoint for any 1 < t < m — 1. Thus no such 
A 4 contains those points. Similarly, the maximum among the ^-coordinates of 
the points in V is 6 n < c, and thus no Vj contains such points. Now consider any 
Tj 1 for l < j. Then the maximum among the ^-coordinates of the points in TJ 1 is 
6 n+l + 10(Z-l) + 9 = 6 n + l + 10Z-l < 6 n+l + 10(j-l)-l < 6 n + l + 10(j- 
l) + 3(f —1) + 1 < c. Thus none of T t [ and X 4 2 contain such points. Now consider 
any Xj 1 for l > j. Then the minimum among the ^-coordinates of the points in 
T{ is 6 n+l + 10(l-l)+2 > 6 n+l + 10(j + l-l) + 2 > 6 n+l + 10(j-l) + 10 + 2 > 
6 n + 1 + 10(j - 1) + 3(3 - 1) + 6 > 6 n + 1 + 10(j - 1) + 3(f - 1) + 2 > c. Thus 
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in this case also none of T) 1 -, and T^ 2 contain the desired points. But this leads 
to a contradiction and hence lj is not considered in the counting before. 

Now \C'\ = b\ and thus in total there are b\ additional vertical lines in L. So 
Nl(<P(P)) > k + {n + b — a — b\) + bi = fc + n + 6 — a. As the total number of 
lines in L is at most k + p, 


k + n + b — a<k+p 
or, n — a + b < p 

Thus the total number of bad clauses is 5(n — a) + b < 5(n — a + b) < 5 p. □ 

Lemma 18. Suppose at most k + |m (0 < 5 < 1) lines are sufficient to hit 
the segments in &(P), then there is an assignment which satisfy at least 1 — 5 
fraction of the clauses. 

Proof. Consider any hitting set of lines L for <P(P) which uses at most k + | m 
lines. We construct an assignment of the clauses having the desired property. Let 
V 1 be the set of good variables corresponding to L. Then by Observation 4 for 
each Xi £ V 1 the optimal configuration with either a Type 1 line or a Type 0 line 
has been chosen. For each Xi £ V 1 if the configuration with Type 1 line is chosen, 
set Xi to 1. Otherwise, set Xi to 0. For each Xi £ V\F 1 assign an arbitrary binary 
value to Xi. We show that this assignment makes at least (1 — 5)m clauses to be 
satisfied. 

Consider any good clause Cj. All of its literals are corresponding to good 
variables and the points of at most two groups of Tj have been separated us¬ 
ing only vertical lines. Let Tj t be the group whose points are separated by a 
horizontal line. Let u and Xi be the literal and variable corresponding to this 
group. If u is Xi , the y-coordinates of the points in Tj t are 6 (z — 1) + 5 — ej 
and 6 (i — 1) +4 + ej. Thus the horizontal line that separates these two points 
is of Type 1, which implies 1 is assigned to Xi and Cj is true. If u is afj, the 
y-coordinates of the points in T? t are 6 (i — 1) + 4 — ej and 6(i — 1) + 3 + ej. Thus 
the horizontal line that separates the points in T? t is of Type 0. This implies 0 is 
assigned to Xi and in this case also Cj is true. Thus any good clause is satisfied 
by the assignment. From Lemma 17 it implies that the number of good clauses 
is at least m — 5|m = (1 — 5)m , which completes the proof of this lemma. □ 

Using the two equations k = 5n + 8m — 4 and 3m = 5 n we get the following 
inequality. 

m > -'j k (16) 

Combining the Inequality (16) with the contrapositive of Lemma 18 it follows 
that, if less than 1 — 5 fraction of the clauses of <j> are satisfied, then the number 
of lines needed is more than k + A 5k = (1 + A5)/c. Thus our reduction ensures 
both of the properties we claimed. 
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A.3.1 Proof of Theorem 8 


Theorem 8 . Suppose there is a polynomial time algorithm that, given <2>(P) and 
non-negative integers r,c as input to CUHS, 

( 1 ) outputs ‘‘yes”, if there is a set with at most c vertical and r horizontal lines 
that hits the segments in <P(P); and 

(2) outputs “no”, if there is no hitting set for <P(P) using at most (l+£)c vertical 
and (1 + £)r horizontal lines, where £ is the constant in Theorem 6 . 

Then P = NP. 

Proof. Assume that there is such an algorithm A. Then we show that, given an 
instance (P, k) of UHS, it is possible to decide in polynomial time whether it 
can be hit by at most k lines, or any hitting set corresponding to it needs more 
than (1 + £)£; lines. Thus from Theorem 6 it follows that P = NP. 

Given the instance (P, k) of UHS, we call A on P, r, c for each pair of positive 
integers r and c such that r + c = k. If A returns yes for at least one pair, then we 
return “yes” for the instance of UHS. Otherwise, we return “no” for the instance 
of UHS. 

If the instance (P, k) admits a hitting set of size at most k, fix such a hitting 
set L. Let L contains r' horizontal and d vertical lines. Then for r = r' and 
c = k — c' > r', A returns “yes”, and so we return “yes” for the instance of UHS. 
If the instance (P, k) does not admit a hitting set of size at most (1 + £)fc, then 
there is no hitting set for <P(P) with at most (1 + £)?’ horizontal and (1 + £)c 
vertical lines such that r + c = k. Thus A returns “no” on all (r, c) pairs on 
which it is invoked, and so we return “no” for the instance of UHS. □ 

A.3.2 Proof of Theorem 9 

Theorem 9. Suppose there is a polynomial time algorithm that, given S, X m , 
w,h, and e as input to CLADS, 

(1) outputs “yes”, if there is an output layout X' with H(X') < h+e and W (A') < 
w + e; and 

(2) outputs “no”, if there is no output layout X 1 2 with H( A') < (l + £ , )(/i + e) and 
W( A') < (1 + £')(w + £ )> where £' = | and £ is the constant in Theorem 6 . 

Then P = NP. 

Proof. Assume that such an algorithm B does exist. Then we show that, given 
an instance (P, r, c) of CUHS, it is possible to decide in polynomial time whether 
there is a set with at most c vertical and r horizontal lines that hits the segments 
in <?(P), or there is no hitting set for T>{P) using at most (1 + £)c vertical and 
(1 + £)r horizontal lines. By Theorem 8 this is true only if P = NP and hence 
the result follows. 

Given the instance (P, r, c) of CUHS, if r = 0 or c = 0, CUHS can be solved 
in polynomial time. Otherwise, we construct a set of squares S' by taking a unit 
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square centered at p for each point p £ P. We invoke Bon5 = S' ,h = r + l,w = 
c+1 and s = £'. If B returns “yes”, we return “yes”. Otherwise, we return “no”. 

Suppose the instance (P , r, c) does admit a hitting set with at most c vertical 
and r horizontal lines. Then by Lemma 1 there is an output layout A' with 
H(X') < r + 1 + £' and width W (A') < c + 1 + £'. Thus B returns “yes”, and so 
we return “yes” for the instance of CUHS. 

Suppose the instance (P, r, c) does not admit a hitting set with at most (l+£)c 
vertical and (l + £)r horizontal lines. Then by Lemma 4, for any output layout A' 
of the squares in S, either H(X') > (1 + £)r + 1 or W(A') > (1 + £)c+ 1. Now as 
c> 1, (l+£)c+l = c+l + 4£ / c = c + (fc -{-1 + 2£ , c+£ 7 c > c+^ , c+l + 2^ , + ^ = 
(1 + £')( c + 1 + CO- Similarly, as r > 1, (1 + £)r + 1 > (1 + £')(r + 1 + £'). Hence 
either H(X') > (1 + £')(r + 1 + £') or W(A') > (1 + £')(c + 1 + £')• Thus at least 
one of the two conditions W{X') < (1 + C)i w + e ) and P(A') < (1 + £')(h + e) 
is false, and B returns “no”, and so we return “no”. □ 
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